AUTHENTICATION APPARATUS 



[0001] This patent application claims priorities from 

Japanese patent applications No. 2003-028180 filed on February 
5, 2003 and No. 2003-384258 filed on November 13, 2003, the 
contents of which are incorporated herein by reference. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0002] The present invention relates to an authentication 

apparatus. More particularly, the present invention relates 
to the authentication apparatus for improving user's 
authentication precision . 

Description of the Related Art 

[0003] In order to manage entrance management or secret 

information, an authentication process may be performed for a 
user. The user's authentication process includes a method of 
using portable recording media holding an authentication key 
and a method of using a head shot and/or a fingerprint etc. For 
details, refer to, for example, Japanese Patent Laid-Open No. 
2002-92495 and Japanese Patent Laid-Open No. 2001-326841. 
[0004] When the user performs the personal authentication 

using the head shot, it is preferable to accord a countenance 
of the user during the authentication process with a countenance 
of a head shot of the user held in advance in order to certify 
the user with higher precision. 



SUMMARY OF THE INVENTION 
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[0005] Therefore, it is an object of the present invention 

to provide an authentication apparatus which can solve the 
foregoing problem. The above and other objects can be achieved 
by combinations described in the independent claims. The 
dependent claims define further advantageous and exemplary 
combinations of the present invention . 

[0006] According to the first aspect of the present 

invention, there is provided an authentication apparatus for 
certifying a user using a picture registered in a memory. The 
authentication apparatus comprises: an image pickup unit for 
taking the picture of the user; and an authentication unit for 
certifying the user by letting the user to bring into a 
predetermined action and comparing the user's picture taken by 
the image pickup unit when the user brings into the predetermined 
action and the picture registered in the memory . 
[0007] In the authentication apparatus, the image pickup 

unit may take an animated image of the user, the memory may supply 
an animated image of the user who brings into the predetermined 
action to the authentication unit, the authentication unit may 
certify the user by comparing the user' s movement in the animated 
image taken by the image pickup unit and the user' s movement 
in the animated image acquired from the memory . 
[0008] In the authentication apparatus, the image pickup 

unit may take an animated image of head shots of the user, the 
memory may supply an animated image of head shots of the user 
to the authentication unit, the authentication unit may certify 
the user by comparing a movement of the head shots taken by the 
image pickup unit and a movement of the head shots acquired from 
the memory. 

[0009] The authentication apparatus may further comprise 

a registration information acquiring unit for acquiring a voice 
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of the user and the head shot of the user corresponding to the 
voice from the memory, and the image pickup unit may take the 
head shots of the user, the authentication unit may certify the 
user by comparing the head shots of the user taken by the image 
pickup unit and the head shots acquired from the memory when 
a degree of coincidence between a voice newly uttered by the 
user and the voice acquired from the memory satisfies a 
predetermined reference . 

[0010] In the authentication apparatus, thememorymay hold 

the animated image of the head shots when the user has uttered 
the voice, the image pickup unit may take the head shots of the 
user as the animated image when the voice has been uttered, the 
authentication unit may certify the user by comparing the animated 
image acquired from the memory and the animated image taken by 
the image pickup unit . 

[0011] The authentication apparatus may further comprise 

a microphone for newly acquiring the voice that the user utters, 
and the authentication unit may certify the user by respectively 
selecting frame pictures at the time that the user utters the 
same voice out of the animated image acquired from the memory 
and the animated image taken by the image pickup unit using the 
voice acquired from the memory and the voice acquired by the 
microphone, and comparing the selected frame pictures. 
[ 0012 ] In the authentication apparatus , the authentication 

unit may certify the user by selecting a plurality of combinations 
of the frame pictures at the time that the user utters the same 
voice out of the animated image acquired from the memory and 
the animated image taken by the image pickup unit, and comparing 
the selected plurality of combinations of the frame pictures 
respectively . 



4 



[0013] In the authentication apparatus, the authentication 

unit may include a synchronization unit for synchronizing each 
frame picture of the animated image acquired from the momory 
with the respective frame pictures at the time that the user 
utters the same voice in the animated image taken by the image 
pickup unit using the voice acquired from the memory and the 
voice acquired by the microphone, and certify the user by 
comparing the synchronized frame pictures. 

[0014 ] In the authentication apparatus, the authentication 

unit may certify the user by selecting a plurality of frame 
pictures at the time that the user utters the same voice out 
of each of the animated image acquired from the memory and the 
animated image taken by the image pickup unit, comparing a 
difference between the plurality of frame pictures selected out 
of the animated images acquired from the memory and a difference 
between the plurality of frame pictures selected out of the 
animated images taken by the image pickup unit, and comparing 
the movements of the user's face. 

[0015] In the authentication apparatus, the authentication 

unit may certify the user by respectively cutting a mouth of 
the user from the plurality of frame pictures, calculating and 
comparing the plurality of the differences of the cut user's 
mouth, and comparing a movement oft he user's mouth in the animated 
image acquired from the memory and a movement of the user' s mouth 
in the animated images taken by the image pickup unit. 

[0016] In the authentication apparatus, the memory may 

store head shots at the time that the user does not utter the 
voice, the image pickup unit may take the head shots at the time 
that the user does not utter the voice, the authentication unit 
may certify the user by comparing the head shots acquired from 



the memory and the head shots taken by the image pickup unit 
at the time that the user does not utter the voice. 
[0017] The summary of the invention does not necessarily 

describe all necessary features of the present invention. The 
present invention may also be a sub-combination of the features 
described above. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0018] Fig. 1 is a view showing a use example of an 

authentication apparatus according to an embodiment of the 
present invention . 

[0019] Fig. 2 is a block diagram showing a configuration 

of the authentication apparatus. 

[0020] Fig. 3 is a flowchart showing an authentication 

process performed by the authentication apparatus. 
[0021] Fig. 4 is a flowchart showing the process of S120 

shown in Fig . 3 . 

[0022] Fig. 5 is a flowchart showing the first 

transformation example of the process of S120 shown in Fig. 3. 
[0023] Fig. 6 is a flowchart showing the second 

transformation example of the process of S120 shown in Fig. 3. 
[0024] Fig. 7 is a flowchart showing the third 

transformation example of the process of S120 shown in Fig. 3. 

DETAILED DESCRIPTION OF THE INVENTION 

[0025] The invention will now be described based on the 

preferred embodiments, which do not intend to limit the scope 
of the present invention, but exemplify the invention. All of 
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the features and the combinations thereof described in the 
embodiment are not necessarily essential to the invention. 
[0026] Fig. 1 is a view showing a use example of an 

authentication apparatus 10 according to an embodiment of the 
present invention. The authentication apparatus 10 certifies 
a user based upon a head shot of the user when the user utters 
his password, and for example, is established at an entrance 
of a room. The authentication apparatus 10 allows the user to 
enter the room when the user has been certified. 
[0027] The authentication apparatus 10 is equipped with 

an image pickup unit 12 and a microphone 14 . The image pickup 
unit 12 takes animated images of the user's head shots. In 
addition, the microphone 14 newly acquires voices uttered by 
the user. Thereby, the authentication apparatus 10 takes the 
head shots of the user who hopes to enter the room as the animated 
image using the image pickup unit 12. In addition, the 
authentication apparatus 10 acquires the voices uttered by the 
user using the microphone 14. Then, the authentication 
apparatus 10 selects a frame picture taken at the time that the 
user has uttered the password from among the animated images 
taken by the image pickup unit 12. 

[0028] Moreover, the authentication apparatus 10 holds the 

user' s head shot when the user has uttered the password. Then, 
the authentication apparatus 10 calculates a degree of 
coincidence by comparing the selected frame picture and the 
pre-held user's head shot, and certifies the user. 
[0029] For this reason, the authentication apparatus 10 

can select the frame picture, the countenance of which is 
identical with the countenance of the user's head shot held in 
advance, from among the taken animated images, and use the frame 
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picture for authentication. Therefore, the authentication 
precision of the authentication apparatus 10 is improved. 
[0030] Fig. 2 is a block diagram showing a configuration 

of the authentication apparatus 10. The authentication 
apparatus 10 comprises the image pickup unit 12, the microphone 
14, an input unit 20, a memory 30, a registration information 
acquiring unit 40, and an authentication unit 50, and certifies 
the users using the pictures registered in the memory 30. The 
authentication unit 50 comprises a password authentication unit 
52, a synchronization unit 54, and a face authentication unit 
56. 

[0031] The input unit 20 includes, for example, a touch 

panel, and acquires identification information of the user such 
as a user ID. The memory 30 holds each user's password and the 
voices and animated images of the users when the users have uttered 
the passwords in response to each of the plurality of users' 
identification information. The registration information 
acquiring unit 40 selects and acquires the password, voice and 
head shot from the memory 30 based upon the user' s identification 
information acquired by the input unit 20. The password 
authentication unit 52 decides whether or not the user has uttered 
the password. The synchronization unit 54 synchronizes the 
animated image generated by the image pickup unit 12 with the 
animated image acquired by the registration information 
acquiring unit 40, and respectively selects the frame picture 
which should be use for authentication out of the two animated 
images. The face authentication unit 56 calculates the degrees 
of coincidence of the selected frame pictures, and certifies 
the user. 

[0032] Furthermore, the details of process of each unit 

will be explained below using flowcharts. 
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[0033] Fig. 3 is a flowchart showing an authentication 

process performed by the authentication apparatus 10 . The input 
unit 20 acquires a user's identification information from the 
user (S20) . The registration information acquiring unit 40 
selects the password, voice and head shot which should be utilized 
for authentication based upon the user's identification 
information acquired from the user (S40) . The authentication 
unit 50 requests the user to utter the password (S60) . Thereby, 
the authentication unit 50 lets the user to bring into a 
predetermined action that the user speaks the password. The 
microphone 14 acquires the password that the user has uttered. 
Then, the password authentication unit 52 decides a word spoken 
by the user using voice recognition technique (S80) , and decides 
whether or not the decided word is identical with the password 
acquired from the memory 30 (S100) . If it is identical, the 
synchronization unit 54 and the face authentication unit 56 
perform a face authentication process (S120) . 
[0034] As mentioned above, the authentication apparatus 

10 certifies the user by performing the face authentication only 
when the user has spoken the registered password. In this case, 
the image pickup unit 12 can take the user's head shot with the 
same countenance as that of the head shot stored in the memory 
30, because the user brings into the predetermined action. For 
this reason, the authentication precision for the user is 
increased . 

[0035] In addition, the authentication unit 50 may let the 

user to bring into not only the action that the user speaks the 
password but other actions determined in advance . For example, 
the authentication unit 50 may let the user to bring into the 
predetermined actions such that the user blinks his eyes or puts 
out his tongue. In this case, the authentication unit 50 may 
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lettheuserto bring into the predetermined actions , for example, 
by giving an impetus that induces an action. For example, the 
authentication unit 50 may let the user to blink his eyes by 
shining a flashlight on the user's face. 

[0036] Moreover, in this case, the animated images that 

the user brings into the predetermined actions are registered 
in the memory 30. Then, in step S20, the memory 30 provides 
the user's animated image of bringing into the predetermined 
action to the authentication unit 50. For example, the memory 
30 provides the animated image of the user's head shots that 
the user speaks the password to the authentication unit 50 
according to the identification information receivedby the input 
unit 20. 

[0037] The authentication unit 50 certifies the user by 

comparing the user's animated image taken by the image pickup 
unit 12 during the predetermined action by the user and the 
animated image registered in the memory 30. In this case, the 
authentication unit 50 may certify the user by comparing both 
traces of actions for each part of the face or the body in both 
animated images. Thereby, the user can be certified with high 
precision . 

[0038] Fig. 4 is a flowchart showing the process of S120 

shown in Fig. 3. The synchronization unit 54 synchronizes the 
animated image taken by the image pickup unit 12 with the animated 
image acquired from the memory 30 based upon the user's voices 
at the time that the user has uttered the password (S200) . For 
example, the synchronization unit 54 synchronizes the two 
animated images by confronting a plurality of frame pictures 
of the animated image taken by the image pickup unit 12 with 
the frame pictures at the time that the same voices are uttered 
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out of the animated image acquired from the memory 30 
respectively . 

[0039] The synchronization unit 54 selects the frame 

pictures corresponding to each other, that is, the plurality 
of the frame pictures at the time that the same voices are uttered 
for each of the two synchronized animated images (S220) . 
[0040] The face authentication unit 56 calculates the 

deg rees Q f coincidence between the head shots taken by the image 
pickup unit 12 and the head shots acquired from the memory 30 
for each of the selected frames (S240) . Then, the face 
authentication unit 56 decides for every frame whether or not 
the calculated degrees of coincidence are more than the 
predetermined reference value . When the number of frames having 
the degrees of coincidence greater than the reference value is 
more than the predetermined reference number (S260: Yes), the 
face authentication unit certifies the user (S280) . 
[0041] According to this flowchart, even if the degree of 

coincidence in the specified frame picture is extremely low, 
for example, by having a countenance other than usual countenance 
when the user utters the password, the authentication apparatus 
10 can certify the user in case that the degree of coincidence 
for the other frame picture is high. 

[0042] Furthermore, the synchronization unit 54 may accord 

both playback times when the user utters the password by expanding 
and contracting play back time of either side out of the animated 
image taken by the image pickup unit 12 and the animated image 
acquired from the memory 30, in step S200. In this case, the 
synchronization unit 54, for example, conforms frame rates of 
both the animated images by thinning out or interpolating the 
frames of either animated image. 
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[0043] Moreover, the face authentication unit 56 may 

certify the user when an average of the plurality of the calculated 
degrees of coincidence is greater than the predetermined 
reference value, in S260 and S280. At this time, the face 
authentication unit 56 may also calculate the average using 
several higher degrees of coincidence, rather than calculate 
the average using all the degrees of coincidence. 
[0044] Fig. 5 is a flowchart showing the first 

transformation example of the process of S120 shown in Fig. 3. 
In this example, the face authentication unit 56 certifies the 
user based upon a user's mouthing when the user utters the 
password . 

[0045] The synchronization unit 54 synchronizes the 

animated image taken by the image pickup unit 12 with the animated 
image acquired from the memory 30 based upon a user' s voice when 
the user utters the password (S300) . Explanation of this process 
will be omitted because the process is identical with that of 
S200 in Fig. 4. 

[0046] Next, the face authentication unit 56 cuts mouths 

from the frame pictures of the animated image taken by the image 
pickup unit 12 and the frame pictures of the animated image 
acquired from the memory 30 (S310) . The face authentication 
unit 5 6 calculates the difference between the mouths' appearances 
for the frame pictures backward and forward in each of the animated 
image taken by the image pickup unit 12 and the animated image 
acquired from the memory 30 (S320) . Then, the face 
authentication unit 56 calculates degrees of coincidence of the 
differences for the animated image taken by the image pickup 
unit 12 and the differences for the animated image acquired from 
the memory 30 (S330) . 
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[0047] In addition, the difference for the mouth's 

appearance is, for example, the difference for the frame pictures 
backward and forward corresponding to the distance between a 
plurality of distinguishing marks obtained from the mouth's 
appearances. The face authentication unit 56 calculates, for 
example, the difference for the distance between both left and 
right ends on a lip. The face authentication unit 56 calculates 
variation of the distances for the frame pictures backward and 
forward as this difference. The face authentication unit 56 
may also calculate the difference for the upward and downward 
width on the lip. 

[0048] Furthermore, the face authentication unit 56 may 

also calculate the difference of the mouth's appearances for 
an interval between an utterance of the predetermined reference 
voice by the user and an utterance of the other predetermined 
reference voice. For example, the face authentication unit 56 
may calculate the difference of the mouth's appearances for an 
interval from when the user utters a first voice of the password 
to when the user utters next voice or final voice of the password. 

[0049] The face authentication unit 56 decides every 

difference whether or not the calculated degree of coincidence 
is greater than the predetermined reference value (S340) . If 
the number of the differences greater than the reference value 
is more than the predetermined reference number (S340: Yes), 
the face authentication unit certifies the user (S350) . 

[0050] Here, the mouth is a part moving most severely in 

the user ' s face when he speaks a word . This movement is different 
from every person. Therefore, when someone else behaves like 
the user, the degree of coincidence of the mouth's movement is 
decreased in comparison with the case that the degree of 
coincidence of the overall face is calculated. Accordingly, 



13 



the authentication apparatus 10 can certify the user with high 
precision. In addition, the face authentication unit 56 may 
also calculate a trace of the specified part of the mouth, and 
certify the user based upon the degree of coincidence of this 
trace . 

[0051] As explained above, in this transformation example, 

the authentication unit 50 certifies the user by comparing the 
user's movement in the animated image taken by the image pickup 
unit 12 and the user's movement in the animated image acquired 
from the memory 30. In addition, the authentication unit 50 
may also certify the user by comparing the movement of the head 
shots taken by the image pickup unit 12 and the movement of the 
head shots acquired from the memory 30, not the movement of the 
mouth. For example, the authentication unit 50 may also certify 
the user by comparing the movements of each part of the face 
in the animated images for both side' s head shots . Furthermore, 
the authentication unit 50 may also certify the user by comparing, 
for example, variations of the head shots such as variation of 
the intervals between both eyes or variation of the lengths of 
nose's lower part. 

[0052] Fig. 6 is a flowchart showing the second 

transformation example of the process of S120 shown in Fig. 3. 
In this example, the face authentication unit 56 certifies the 
user based upon the head shot at the time that the user does 
not make a voice . In addition, when performing the process shown 
in Fig. 6 in S120, the authentication apparatus 10 may not also 
include the synchronization unit 54. Moreover, the memory 30 
may hold still pictures at the time that the user does not make 
a voice in response to the user's identification information. 
[0053] The face authentication unit 56 selects a plurality 

of frame pictures at the time that the user does not make a voice 
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from among the animated images taken by the image pickup unit 
12 based upon the voices acquired by the microphone 14 (S400) . 
Then, the face authentication unit 56 calculates the degrees 
of coincidence between each of the selected frame pictures and 
the still picture of the face held by the memory 30 (S410) . When 
any degree of coincidence is more than the predetermined reference 
value (S420: Yes), the face authentication unit 56 certifies 
the user (S430) . 

[0054 ] As mentioned above, in this transformation example, 

the memory 30 holds the still pictures of the faces at the time 
that the users do not make voices. In addition, the face 
authentication unit 56 selects the frame pictures at the time 
that the users do not make voices from among the animated images 
taken by the image pickup unit 12. For this reason, the 
possibility that the user' s countenance in the head shot held 
by the memory 30 accords with the user' s countenance in the frame 
picture selected out of the taken animated images becomes high. 
Therefore, if the authentication process is performed by this 
transformation example, the user can be certified with high 
precision . 

[0055] Furthermore, in S420 and S430, the face 

authentication unit 56 may certify the user when the number of 
the frames having the degree of coincidence greater than the 
predetermined reference value is more than the predetermined 
reference number. Moreover, the face authentication unit 56 
may certify the user when the average of the calculated degrees 
of coincidence is greater than the predetermined reference value . 

[0056] Furthermore, in this transformation example, the 

authentication unit 50 detects the status that the user does 
not make a voice using the user' s voice detected by the microphone 
14. Then, the authentication unit 50 certifies the user based 
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upon the head shot at the time that the user does not make a 
voice. In this case, the authentication unit 50 directs the 
user to make a posture that the user does not make a voice as 
the predetermined posture. 

[0057] Fig. 7 is a flowchart showing the third 

transformation example of the process of S120 shown in Fig. 3. 
The authentication unit 50 certifies the user based upon both 
of the degree of coincidence of the head shot and the degree 
of coincidence of the voice. 

[0058] The synchronization unit 54 synchronizes the 

animated image taken by the image pickup unit 12 with the animated 
image acquired from the memory 30 based upon the user's voice 
at the time that the user utters the password (S500) . Thisprocess 
is the same that of S200 shown in Fig. 3. 

[0059] Then, the synchronization unit 54 selects the frame 

pictures corresponding to each other , that is, the frame pictures 
at the time that the same voices are uttered out of each of the 
two synchronized animated images (S510) . 

[0060] The face authentication unit 56 calculates the 

degree of coincidence between synchronizes the head shot taken 
by the image pickup unit 12 and the head shot acquired from the 
memory 30 for each of the selected frames (S520) . Then, the 
face authentication unit 56 calculates the average of the upper 
degrees of coincidence among the calculated degrees of 
coincidence as the degree of coincidence for the head shot (S530) . 
In addition, the authentication unit 50 calculates the degree 
of coincidence between the user's voice held in the memory 30 
and the user's voice acquired by the microphone 14 (S540). 
[0061] Then, when the degree of coincidence calculated by 

the face authentication unit 56 and the degree of coincidence 
for the voice satisfy a predetermined reference (S550: Yes), 
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the authentication unit 50 certifies the user (S560) . The 
predetermined reference means that a point defined by the degree 
of coincidence of the head shot and the degree of coincidence 
of the voice is in the range of the predetermined region in 
two-dimensional space defined by the degree of coincidence of 
the head shot and the degree of coincidence of the voice. The 
predetermined region is the region that satisfies, for example, 
the requirement that both of the degree of coincidence of the 
head shot and the degree of coincidence of the voice are more 
than the reference value. 

[0062] According to the invention, the authentication 

apparatus 10 certifies the user based upon the degree of 
coincidence of the user' s voice and the degree of coincidence 
of the frame picture synchronized by the voice. Therefore, the 
user can be certified with high precision. 

[0063] As explained above, according to the present 

invention, the apparatus can match the user's face countenance 
during the authentication to the countenance of the user's head 
shot held in advance. Therefore, the user can be certified with 
high precision . 

[0064] Although the present invention has been described 

by way of an exemplary embodiment, it should be understood that 
those skilled in the art might make many changes and substitutions 
without departing from the spirit and the scope of the present 
invention. It is obvious from the definition of the appended 
claims that embodiments with such modifications also belong to 
the scope of the present invention. 



